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ON FINITE RANK DEFORMATIONS OF WIGNER MATRICES 
II: DELOCALIZED PERTURBATIONS 

DAVID RENFREW AND ALEXANDER SOSHNIKOV 



Abstract. We study the distribution of the outliers in the spectrum of finite 
rank deformations of Wigner random matrices. We assume that the matrix 
entries have finite fourth moment and extend the results by Capitaine, Donati- 
Martin, and Feral for perturbations whose eigenvectors are delocalized. 



1. Introduction 



0< 

r\ , In this paper we continue the study of the eigenvalues of finite rank deforma- 

tions to Wigner random matrices, extending the results of [44 to a larger class of 
perturbations. 



Let Xjv := ~7m^n be a random Wigner real symmetric (Hermitian) matrix. In 
the real symmetric case, we assume that the entries 

CWjv^fc, l<j<fc<A^, 

en , 

are independent random variables such that the off-diagonal entries satisfy 
EKWjv)^] = 0, V[(W N ) jk ] = a 2 , 1 < j < k < N, m 4 := sup E[(W w )jJ < oo, 

and the Lindeberg type condition for the fourth moments takes place, 

L N (e) -> 0, as N -> oo, Ve > 0, (1.2) 

where 

L «^ = ^2 E E (l( W ^)^| 4 l{|(w JV ) IJ |> e ^/4 } ). (1.3) 

l<i<j<N 

Here and throughout the paper, E£ denotes the mathematical expectation and V£ 
^h . the variance of a random variable £. In addition, we assume that the diagonal entries 

satisfy 

E[(Wjv)«] =0, 1 < i < N, g\ := su P E[(WAr)2] < oo, (1.4) 

i.N 

^jv(e) -> 0, as N -> oo, Ve > 0, where (1.5) 

^( £ ) = ^ E E (i( w ^i 2l { i(w N) „i> e v^})- (i- 6 ) 

l<i<7V 

We note that (|1.2p and (|1.5I) are satisfied if there exist an e > such that 

sup E[(Wiv)i/ e ] < oo, supE[(W w )2+ £ ] < oo. (1.7) 
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2 D. RENFREW AND A. SOSHNIKOV 

If the off-diagonal elements (Wjv)jfc are identically distributed Gaussian and the 
diagonal elements (Wn)u are also identically distributed Gaussian with twice the 
variance of the off-diagonal elements then W^ is said to belong to the Gaussian 
Orthogonal Ensemble (GOE). 

In the Hermitian case, we assume that the entries 

Re(Wjv)ife, Im(W*)#, 1 < j < k < N, (W N ) vl , l<i<N, 

are independent random variables such that the off-diagonal entries satisfy 

ERe(Wjv)ifc=EIm(Wjv) ifc = 0, 1 < j < k < N, (1.8) 

2 

VRe(Wjv)jfe = VIm(Wjv)j* = — , 1 < j < k < N, m 4 := sup E|(W w ) jfe | 4 < oo, 

(1.9) 

and the Lindeberg type condition (11.21) for the fourth moments of the off-diagonal 
entries takes place. In addition, we assume that the diagonal entries satisfy 

E(Wjv)« = 0, 1 < i < N, o\ := supEKWw),,! 2 < oo, (1.10) 



and the Lindeberg type condition (11.51) for the second moments of the diagonal 
entries takes place. 

If the real and imaginary parts of the off-diagonal elements (Wjv)^ are indepen- 
dent identically distributed Gaussian random variables and the diagonal elements 
(Wjv),i are also identically distributed Gaussian random variables with twice the 
variance of the real part of the off-diagonal entries then Wjv is said to belong to 
the Gaussian Unitary Ensemble (GUE). 

We refer the reader to [T], [2], [5], and [33] for basic results about standard real 
symmetric and Hermitian Wigner matrices. In particular, the Wigner semicircle law 
states that the empirical distribution of the eigenvalues of Xjy = -7= Wjy converges 
as N — > 00 to the nonrandom limiting probability distribution /j, sc , known as the 
semicircle distribution, whose density with respect to the Lebesgue measure is given 

by 



(x) := — y/^-x^l, 2 „ 2 *](x)- (1.11) 



dx 2ira 2 

The Stieltjes transform of the semicircle distribution 



is the solution to 



. , f dlXsrix) Z — V ' Z 2 — 4<7 2 „. . . . 

9*(z) := / -f^ 1 = -1^2 . z e C \[- 2(7 ' M- (1-12) 

a 2 gl(z)-z grT (z) + 1 = (1.13) 

that decays to as z — \ 00. 

We consider the spectrum of -j=Wn + An = X^ + An where A^v is a de- 
terministic real symmetric (Hermitian) matrix of fixed finite rank r. Spectral 
properties of finite rank perturbations of Wigner matrices have been studied ex- 
tensively since the pioneering paper 25 by Fiiredi and Komlos who considered 
(Afj)ij = -=, 1 < i,j < N, which corresponds to the case of a Wigner matrix 
with non-centered entries having mathematical expectation c, where c is a fixed 
non-zero real number. It was shown that the largest eigenvalue of X^ + Ajv is 

I 2 

asymptotically normal with mathematical expectation cv N H — ^— anc | variance 
=j*- in the real symmetric case and ^- in the Hermitian case. 
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The more mathematically challenging case when c = -4= was studied in [32] (in 
the GUE case) and [23] (for arbitrary Hermitian Wigner matrices with symmetrical 
sub-Gaussian marginal distribution). In particular, it was shown that there is a 
phase transition at 9 = a. For 9 > a the spectrum of Xjv + Ajv has one outlier 

2 

which is asymptotically normal with mathematical expectation p = 9 + ^- and 
variance — ^^ — ' . For < 6 < a, the largest eigenvalue of Xjy + An fluctuates on 
the scale JV~ 2 / 3 around 2a, and has Tracy- Widom distribution in the limit. Large 
deviations for the outlier in the Gaussian case were studied by Maida in [55] . 

The case of arbitrary finite rank perturbations has been considered by Peche 
( |42) ) for a GUE matrix and by several other authors for real symmetric and Her- 
mitian Wigner random matrices (see e.g. [IT], [18], [19], [10], [IT], [12], [33], [32] 
and references therein). We also refer the reader to [7J, [8], [14], and [15] for re- 
lated results for unitary and orthogonal ensembles of random matrices. Finally, we 
note several results about the outliers in the spectrum of spiked sample covariance 
random matrices ([30], [5], [41], [6]) and non- Hermitian random matrices ([50]). 

Let us denote the ordered eigenvalues of A^ by 9\ > . . . > 6j. The multiplicity 
of 9j is fixed and denoted by fcj, 1 < j < J. We assume that both the eigenvalues 
of Ajy and their multiplicities are independent of N. Let jo be such that 9j = 0. 
Thus, An has jo — 1 distinct positive eigenvalues (not counting multiplicities). The 
ordered eigenvalues of X^v + A^v are denoted Ai > . . . > Xn- Our first theorem is: 

Theorem 1.1 (|17j. [44] ) . Let Xjv = — t^^n be a random real symmetric (Her- 
mitian) Wigner matrix defined in &1.1W1.6]) (respectively &1.8]\1.1(J\) ). Let An be a 
deterministic real symmetric (Hermitian) matrix of fixed finite rankr as above. Let 
J+ct (resp. J- a ) be the number of j 's such that 0j > a (resp. 9j < —a) and let 

2 

p g = Pj :=9j + —. (1.14) 

Then the following holds: 

(a) For 1 < j < J +a , 1 < i < k J ,X kl+ ... +kj _ 1+t -» p. } , 

(b) A fcl+ ... +fc;+CT+ i -» 2cr, 

(c) X kl+ ... +k ,_ Ja -» -2ct, 

(d) For j> J - J_ CT + 1, 1 < i < k ,\ kl+ ... +kj _ l+i -> p r 
The convergence in (a)-(d) is in probability. 

In [17], Capitaine, Donati-Martin, and Feral consider Wigner matrices with i.i.d. 
entries whose marginal distribution of the matrix entries of Wjy is symmetric and 
satisfies a Poincare inequality (|1.15p . Under these conditions they show that the 
convergence in Theorem 11.11 takes place almost surely. We recall that a probability 
measure P on M. satisfies a Poincare inequality with constant v > if, for all 
continuously differentiable functions / : R M — > C, 

Mf) = Ep (\f(x) - Mf(x))\ 2 ) < -E P [\Wf(x)\ 2 }. (1.15) 

v 

It is known that a probability distribution satisfying the Poincare inequality (|1.15[) 
has a subexponential tail (see e.g. [1]). 

In [44j , we consider Wigner matrices with five finite moments for the off diagonal 
entries and three finite moments for the diagonal entries. Using the standard trun- 
cation argument (see the appendix of this paper) these conditions can be weakened 
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to those in (|1.HI1.6|) (respectively (jl.8lll.10jl ). In this paper we will cite theorems 
from [44] assuming the conditions (|l.lll.6|l (respectively (|1.8ll.l0jl ) without further 
comment. 

The fluctuations of the outlying eigenvalues around pe . have been studied in |18) , 
[44] , and [32] . These fluctuations are dependent on the form of the perturbation. 
In particular, additional assumptions on the eigenvectors of An are required for a 
distributional limit to exist and the limiting distribution of the (properly rescaled) 
outliers depends on the localized/delocalized nature of the eigenvectors of A^v cor- 
responding to 6j . 

In [18], the authors consider two regimes: 

Case A ("The eigenvectors don't spread out") 

The orthonormal eigenvectors of An corresponding to 9j are spanned by a finite 
number Kj of canonical basis vectors of C N (without loss of generality we can 
assume those canonical vectors to be e±, . . . , ck ■), and the (non-zero) coordinates 
of these eigenvectors are independent of N for all sufficiently large N. 

Case B ("The eigenvectors are delocalized" ) 

The l°° norm of every orthonormal eigenvector of An corresponding to 6j goes 
to zero as N — > oo. 

We denote by k+ a := k\ + . . . + kj +a the number of positive eigenvalues of An 
bigger than a (counting with multiplicities) and by k > k +a the minimal number 
of canonical basis vectors ei, . . . , ejv of C^ required to span all the eigenvectors 
corresponding to the eigenvalues 9±, . . . , 9j +a ■ 

Let us denote 

% ■= ef^- (L16) 

The following theorem concerning fluctuations in case A was proved for symmet- 
ric marginal distribution satisfying the Poincare inequality in [18] . It was extended 
to the assumptions (11.1111. 6p (respectively (|1.8H1.10I0 in [ti] . 

Theorem 1.2 (Theorem 1.3 in 44 ). Let Xat = — 5=Wtv be a random real sym- 
metric (Hermitian) Wigner matrix defined in \1. JIU.6')) (respectively ll.S&l.lOjl ). In 
Case A, the kj -dimensional vector 

{c eo y/N(X kl+ ... +k] _ 1+l - pj), i = l,...,kjj 

converges in distribution to the distribution of the ordered eigenvalues of the kj x kj 
random matrix Vj defined as 

V, :=U*(W,+H 3 )U„ (1.17) 

where Wj is a Wigner random matrix of size Kj with the same marginal distribu- 
tion of the matrix entries as Wjy, Hj is a centered Hermitian Gaussian matrix of 
size Kj, independent ofWj, with independent entries H st , 1 < s < t < Kj, with 
the variance of the entries given by 

mi)=( m4 ~t^' u l+^,« = i ^ (i-is) 

E(\H st \ 2 ) = ——», \<s<t<K 3 , (1.19) 
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and Uj is a Kj x kj such that the (Kj -dimensional) columns of XJj are written 
from the first Kj coordinates of the orthonormal eigenvectors corresponding to 9j . 

The next theorem deals with the Case B and is this paper's main result. Before 
stating the theorem we define the matrix of third moments on the off-diagonal by 

(M^ := n 3>ij (l - dij), (1.20) 

where ms^j :^E[\W lJ \ 2 W lJ }. 

Theorem 1.3. LefK^ = -^Wjv be a random real symmetric (Hermitian) Wigner 

matrix defined in H 1. VM.O) (respectively I1.8&l.l0() ). Let u Nl . . . , u^ be a set of or- 
thogonal eigenvectors of An with eigenvalue 9j . In Case B, the difference between 
the kj -dimensional vector 

[c ej VN(X kl+ ... +k] _ 1+t - pj), i = l,...,kjj 

and the vector formed by the (ordered) eigenvalues of a kj x kj GOE (GUE) matrix 

e 2 cr 2 
with the variance of the matrix entries given by „■/_ 2 plus a deterministic matrix 

with Ip entry (1 < l,p < kj) given by g2]y( u A')*M3 u JV converges to zero in 
probability. 

The proof of Theorem 11.31 is in Section 0] 

Theorem ll.3l was originally proved in [18] under additional technical assumptions 
that the entries are i.i.d., their marginal distribution is symmetric and satisfies the 
Poincare inequality, and k — o(\//V), where we recall that k is the minimal number 
of canonical basis vectors ei, . . . , ejv of C required to span all the eigenvectors 
corresponding to the eigenvalues bigger than a. In [35], Knowles and Yin prove 
Theorem 11.31 provided 8j is a simple eigenvalue. It should be noted that Knowles 
and Yin allow the non-zero eigenvalues of Ajy to depend on N as long 

110,1 - 2<j\ > <P Cl Y- 1 / 3 , min|0j - 0<| > ^Y" 1 / 2 ^! - 2a!)- 1 / 2 , 

where 4> := (log Y) loglogJV , and C\ is a positive constant. In their approach, 
Knowles and Yin need an additional technical assumption, namely they require 
that the marginal distributions of the entries of W^v are uniformly sub-Gaussian, 
in a sense that 

n\(WNh\>x)<d- 1 eM~x d ) 
for some d > 0. After this paper was posted online, Knowles and Yin extended 
their results to arbitrary deterministic matrices with bounded norm and fixed rank 
in [31]. 

In addition to the results mentioned above, Knowles and Yin prove the univer- 
sality of the limit distribution of the largest "sticking" eigenvalues of Xat + Ajv, 
i.e. the eigenvalues that correspond to \9j\ < a (see Theorem 2.7 in [32]). In 
other words, they prove that the limit distribution of the 1st, 2nd, 3rd, etc largest 
"sticking" eigenvalues of Xjv + A^ (i.e. Afc +CT i, \k+„+2, Afe +(r +3, etc) coincides with 
the limit distribution of the 1st, 2nd, 3rd, etc largest eigenvalues of Xjy. We recall 
that fc+ CT denotes the number of the eigenvalues of Ajv greater than a (counting 
with multiplicities). In the Gaussian case, the limiting distribution of the largest 
eigenvalues of a GUE (GOE) random matrix was first studied by Tracy and Widom 
in [53] and 54 . It is now known as the Tracy- Widom distribution. For the results 
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about local universality in non-perturbed Wigner matrices we refer the reader to 
[29] , [21] , [23] , [52] , [51] , [33] , [48] , and references therein. In the case of random 
An the universality of the distribution of "sticking" eigenvalues was proved by 
Benaych-Georges, Guionnet, and Maida in [TT] . 

The first step in proving Theorem 11.31 is to use Proposition 11.11 to associate the 
fluctuations of the eigenvalues outside of the support of the semicircle law with 
quadratic forms of the resolvent of the unperturbed Wigner matrix, given by 

R N (z) ■= (zIn ~ Xtv) -1 . 
Let us denote by 

JV 

(u,v) := ^2mv l7 (1.21) 

the standard scalar product in C N and by 

||u|| = \/(u,u) 
the induced norm on C N . 

Proposition 1.1 (Proposition 1.1 in [H]). Let 9j be an eigenvalue of An that is 
greater in magnitude than a and let u N , . . . , u^ be an orthonormal set of eigenvec- 
tors of An associated with 9j . Let a 3 N be the kj x kj matrix with entries 

% ■= Vn((u n ,~Rn(pj)u p n ) - g a { Pj )8i p ) = y/N ({u N ,R N ( Pj )u p N ) - j-Si p 

(1.22) 
Let j/i > . . . > j/fc be the ordered eigenvalues of the matrix H^. Then 

N(Xk 1 +...+k j ^ 1 +i - Pj) + —, — ;Vi -> 0, i = l,...,kj, (1.23) 

9a \Pj 1 



in probability. 

Remark 1.1. A simple computation gives 

9 ' (ft) ^'-^)^ ? -' 2 ' ,L24) 

In the second step of proving Theorem 1 1.31 we truncate and remove the diagonal 
terms, defining 

(Wjv)y := (W N )ij(l - <5ii)l{(w JV ) iJ <Jv 1 /4} - E(Wjv)y(l - %)l {(W]v)ij < Ar i/4 } , 

~Kn '■— —t=~Wn, and Rjv(z) := (zIn — X^r) -1 for z E C. Our goal is to compute 

the centering, E(u5y, Rjv (z)u I N ) , which in general, is dependent on the form of the 
vectors. Define 

(M 3 )y :=/2 3 ,y(l-<*y), l 1 - 26 ) 

where p, 3tij := E[|Wy| 2 Wy]. 

In the appendix fSection !5.ip we show that the difference 



/ JV((uAT,Rj V (2:)vAr) - (uAT,Rjv(z)vAr)) 

goes to zero in probability. It is therefore sufficient to use E[(ujv, Rat(z)vjv)] as the 
centering for (un,'R-n(z)vn} when proving distributional conference (see proof of 
Theorem rO]). 
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Theorem 1.4. Let~X.N = ~t^^n be a random real symmetric (Hermitian) Wigner 

matrix defined in lil.l^l.&jl (respectively i f 1.8^1. lDj) ) and Xjv, Rjv are the matrices 
defined after il.25]) . Let un,vn be N -dimensional unit vectors. 

E(u w , R N (z)v N ) - g„(z){u N , v N )} = l g 4 ( z )(u N , M 3 v N ) + O [ fi»(lM^)(N + 1 ) 

(1.27) 

uniformly on C \ M, where P\ 2 is a polynomial of degree 12 wra£/i /i:red positive 
coefficients. 

Remark 1.2. 5mce HM3H = O(N), we have -^(ujy, M^v^r) = 0(1). Furthermore, 
i/||ujv||i or ||vjv||i is o{yN) then -k(\iN, 'M.^vn) = o(l). In addition, since 
V-3.ij ~ M3,y = 0(N~ 1 ' 4 ) uniformly in i,j, it follows that 

-^(ujv.MgVjv) - -^(u N ,M 3 v N ) = O^Ar 1 / 4 ). (1.28) 

The proof of Theorem 11.41 is in Section [2J 

In the final step we compute the joint asymptotic distribution of the matrix 
entries of S N . Before stating the theorem we introduce the notation 

U(zi,z 2 ) := -ga{zi)ga{z 2 ) + 2 — r~\ — T^T ( L29 ) 

Theorem 1.5. Let X^v = -r= W^v be a random real symmetric (Hermitian) Wigner 
matrix defined in U. Rl.&j) (respectively kl.$bl.ltj\) ). Let ujy, . . . , u^ be a sequence 
of mutually orthogonal N dimensional unit vectors such that ||ujy||oo ~~ * f° r 
I = 1, . . . ,m as N —¥ 00. Let Gn(z) be the m x m matrix defined by: 



{G N {z)) lp := VN ({u l N , Rn(z)u p n ) - E(u l N ,R N (z)u p N )j . (1.30) 

The matrix valued function Gjv(z) converges in finite dimensional distributions to 
the m x m matrix-valued random field, T(z) with independent, centered, Gaussian 
entries with covariance given by: 

E[Re(r lp (z 1 ))Re{r lp {z 2 ))}^^(U(z 1 ,z 2 )+U(z 1 ,z 2 )) + ^(U(z 1 ,z 2 )+U(z 1 ,z 2 )) 

+ ^(p(n(z 1 ,z 2 ) + n(z 1 ,z 2 )) + u(z 1 ,z 2 ) + n(-z 1 ,j 2 )) 

E[Re(r /p ( 2l ))Im(rVz 2 ))] = ^(H(«i, z 2 ) - H(ti, I2)) + j(-n(z x , z 2 ) + H(?i, z 2 )) 

+ -£(p(-n(z 1 ,z 2 )+Ii(zi,z 2 ))+R.(zi,Z2)-n(zi,Z2)) 

E[im(r /p (2i))im(r / j,(z2))] = ^(n(z 1 ,z 2 ) + n(zi,z 2 )) + -(-n(z 1 ,z 2 )-n(zi,z 2 )) 

+ Z 7^(p(-n(2i,z 2 )-n(z 1 ,z 2 )) + n(z 1 ,z 2 ) + n(z 1 ,z 2 )) 



/or Z < p and T; p (z) = r p j(z) /or / > p. Where p = 1 i/ Wat is reaZ symmetric and 
p = if Wat is Hermitian. 
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Remark 1.3. The result of the above theorem about Gaussian fluctuation o/(Gjv(^))j„ 
can be extended to the case when either 111% Hoo — > or IJu^Hoo — > 0. See Lemma 



For the recent results on the fluctuations of the matrix entries of regular functions 
of Wigner matrices, we refer the reader to [35], [43], [40], [36], [3], and [37]. Theorem 
1.5l was proved in [3] in the one vector case, assuming that the third moment of the 
matrix entries vanishes. We outline the proof of Theorem II .51 in Section [3] showing 
that the arguments of [5] can be adapted. 

Finally, we extend the result of Theorem [T3] to a sufficiently large class of regular 
test functions. Consider the space H s consisting of the functions / : R — > R that 
satisfy 



11/112:= / (1 + \k\) 2s \f(Wdk < oo, (1.31) 

where f(k) is the Fourier transform 

/» = J= / e- ik *f{x)dx. 

V Ztt Jo 



Theorem 1.6. LeiXjv = -LWjv fee a random real symmetric (Hermitian) Wigner 
matrix defined in lit. 1^1. 6\) (respectively fl.$bl.lfl\) ). Let u]^, . . . , u^ be a sequence 
of mutually orthogonal N dimensional unit vectors such that ||iV]y||oo —> for 
I = 1, . . . ,m as N — > oo. Finally, let f : R — > R belong to 'tis for some s > 4. 
Denote 

*W/) : = ^ («> /( X «W) - E(u' N , /(X Af )u^)) , (1.32) 

l<l,p< k. 

Then the joint distribution o/{Y/v,ip, 1 < ? < P < to}, converges as N — > oo io 
t/ie distribution of independent centered normal random variables with the variance 

^T T / 2CT (/(*) - /(l/)) 2 2-5-1 V 4 * 3 " ^V^ 2 - l^rfarrfl/, (1-33) 

where j3 = 1 in the real symmetric case and /3 = 2 in the Hermitian case. 

In addition, if f has a sufficiently large number of derivatives (13 is enough) and 
compact support, 

E«, f(± N )u p N ) =S tp ^ f( x )-L^y/4a 2 -x*dx (1.34) 

J -2a 2lT<J 



N- 3 / 2 (u l N ,M 3 u p N ) f ° f(x){2xo-- A + x 3<j- 6 )-^V^ - xHx + OiN- 1 ). 

(1.35) 



Remark 1.4. One can extend |_?.3^|T05j) to f satisfying 11/1113,1,+ < oo, where 



|/||n,i,+ : = max / (M + l 



+oo 



— (x) 

dx l 



dx, < I < n . (1.36) 



Remark 1.5. Under the additional strong technical assumptions that the matrix 
entries ofW^ o-re i.i.d. and logEexp(i(Wjv)n) and Eexp(t|(Wjv)n|) are entire 
functions, the first part of the result of Theorem \1.6\ follows from Theorem 5. 1 in 
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In Section 2 we compute the centering for (ujv, Rat(z)vat) = u^Rat^vat. The 
fluctuation results (Theorems 11.51 and II. 6p are established in Section 3. In Section 
4 these results are put together to prove Theorem 11.31 Finally Section 5 contains 
technical arguments for Section 2 as well reductions on the matrix ensemble under 
consideration. 

2. Expectation of quadratic forms of the resolvent 

In this section, we prove Theorem 11.41 We will suppress dependence on N and 
z when possible. 

We begin by refining previous estimates on the expectation of the entries of the 
resolvent. In [43] and [40], we proved the following result 

Proposition 2.1 (Proposition 3.1 [43 [). Let Xjy = — ^Wat be a random real 

symmetric (Hermitian) Wigner matrix and Rjv(z) = (zip? — Xjv)" 1 where z G C 
Then 

ER ii (z)=g a (z) + 0(^^ N y (2.1) 

ERi > {Z) = ° {\^zJn) '^^3<N, (2.2) 

v^ W = o( I ^) ,!<<,;<*, (2.3) 

uniformly on bounded subsets of C \ K. 
In addition, if 

sup E[|(Wjv)ii| 5 ] < oo, supEIKWat),,! 3 ] < oo, 

i^j,N i,N 

then 

1 

llmzj 9 ^ 372 
uniformly on bounded subsets of C \ R. 

Using Lemma EH] and Section 8.3 of [3] we set the diagonal entries of Wat to 
and truncate the off-diagonal elements at N x ^tN for some cm — > as N — >• oo 
without changing the limiting distribution of \/iVu A rRAr(z)v./v. We note that [3] 
concerns the i.i.d. case but our assumption (|1.3|) is the averaged version of (8.18) 
from [3] and is sufficient for the proof. We also note that the truncation will change 
the second and third moments of the entries of Wat but this error is neglectable 
and will not be mentioned again. The estimate (|2.4[) is improved in the following 
theorem: 

Proposition 2.2. Let Xat = -LWjy be a random real symmetric (Hermitian) 
Wigner matrix defined in il.lVll.6\) (respectively ll.8&l.lU\) ). and (Wat)jj = (Wat)jj(I- 

%) 1 {(w N ) !J <A'i/4}- IE [( w Jv) J i(l-'5y)l{(w 1 v)„<A'i/4 } ] and R N (z) = (zIn-Xn)- 1 
where z G C. Then for i =/= j : 

N 3/2y^ z > + u y N 2 

uniformly on bounded subsets o/C\R. Where Jj,3,ij — E[|Wy| Wij 



E %W = ° ( FT— W^ui. ) > 1 < *¥>3 < N, (2.4) 



[%(-)] - pw*) + o M N r ' ) ( 2 - 5 ) 
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Equation (|2.5j) establishes Theorem 11.41 bv computing E[u^Rat(z)vat] for unit 
vectors un and vat. 

The proof in the real case is below, the complex case follows similarly. 

Proof. For notational convenience we set Xjy = Xjy and Rat(z) = Rjv(z). The 
proof of this theorem is similar to the proof of equation (|2.4|) in [43] but also uses 
the estimates established in Proposition 12.11 of [43] to refine the error terms. The 
following estimates will be useful: 



In dealing with resolvents, we will use the resolvent identity 

(zl - A 2 )~ l = (zl - A,)- 1 - (zl - A l )-\A 1 - A 2 )(zl - A 2 )- 1 (2.7) 

which holds for all z £ C where (zl — A{) and (zl — A 2 ) are invertible. 

In addition, we will use the decoupling formula (see for example 31 ): for any 
real-valued random variable, £, with p + 2 finite moments and <fi a complex-valued 
function with p + 1 continuous and bounded derivatives 

E(M0)=E^%> (o) (0) + e (2.8) 

a=0 

where n a are the cumulants of £ and e < Csup t |< ( o( p+1 ''(£)|E(|£| p+2 ), C depends 
only on p. We will use the notation K a ,ij for the a th cumulant of the (ij) — th entry 
of Wat. 

The resolvent identity (|2.7[) , R\ 2 = z~ 1 J2 k XikRk2, and decoupling formula 
dH]) give 

2 

*E[ii 12 ] = c7 2 E[ J R 12 tr JV (R JV )] + ^E[(R%) 12 ] (2.9) 

+ ^75 Yl K ^ lk (4E[RuRikRkk] + 2E[R 11 R kk R k2 ] + 2E[R 2 lk R 2k }) + r N 

k 

Where rjv contains the 3 < a < 6 terms from the decoupling formula (|2.8[) and the 
error from truncating at p = 6 in the decoupling formula. Additionally, since we 
have set Xu = for 1 < % < N we need to add and subtract ^j-E[RuRi 2 }. To ease 

notation, we also add and subtract — jv<a+ °|/ 2 ,12 E[i?i2-Ri 1 ]. This term allows future 
summations to be over all k. Using (I2.3[) . (I2.6[) and the Cauchy-Schwarz inequality 
we see these additionally added terms are 0(N~ 2 ). In the future, when using the 
decoupling formula we will not mention the error from the diagonal terms. 
To estimate the error terms we use Lemma 9.2 of [15], which states: 

E[R 12 tr N (R N )] - E[i? 12 ]E[tr A ,(R iv )] = O fMIl^fO 

Since the assumptions in Lemma 9.2 are more restrictive than this paper a few 
words are needed to justify use of this lemma. Its proof follows by applying the 
resolvent identity, (|2.7[) . to R 12 and then apply the decoupling formula, (12.81) . to 
J2 k E[R lk X k2 ]E[tr N (R N (z)]{mdtoJ2 k E[R lk X k2 tr N (R N (z)]. ThenE[i? 12 tr JV (R A r)]- 
E[_Ri 2 ]E[trAr(Rjv)] is estimated in a similar manner to the proof of Lemma I2TT1 

Lemma 9.2 assumes the distribution of the matrix entries satisfy a the Poincare 
inequality (|1.15[) and are symmetric. The Poincare inequality assumption gives 
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a bound on the variance of the trace and also implies that all moments of the 
distribution of the matrix entries are finite. To bound the variance of the trace we 
can instead use Lemma 2 of [45| which states 

v( trjv(Rw(2) )) = o(M^ 

Even though the result of Lemma 2 in |45j was stated under the assumption that 
both the off-diagonal and the diagonal matrix entries have finite fourth moment, 
the actual proof holds under the assumptions (|1.HI1.6[) in the real symmetric case 
and (| 1 . 8ti 1 .10]) in the Hermitian case [46] . 

Because the entries of Wjv have been truncated the a th moment for a > 4 grows 
no faster than AT(° -4 )/ 4 , this bound combined with estimates (j2.2[) and (|2.3p allow 
the higher order error terms in the decoupling formula to be bounded. The proof 
is lengthy but straight forward and very similar to proof of Lemma 12.11 

Since we do not assume the distribution is symmetric, the third moment might 
not vanish, but the third cumulant terms in the decoupling formula can be esti- 
mated exactly the same as the fourth cumulant terms in [TS], after noting that the 
summation over k will contribute at most a factor of N 1 / 2 instead of N as in the 
fourth cumulant term. 

Furthermore, part (ii) of the Theorem 3.2 in 143 with f(x) — l/(z — x) 2 implies 



The following lemma will help bound the third cumulant terms of (|2.9[) . It is an 
improvement of [44] . where the Cauchy-Schwarz inequality was used to give an error 
bound (9(iV- 3 / 2 ). 

Lemma 2.1. 

rt /, E[) j K3.ik(Rl2RlkRkk + Rk 2 R\\Rkk + R2kR\k)] (2.10) 

k 

= rrij J2 ^ikinR^MRikRkk] + E[R k2 ]E[R n R kk ] + E[R 2k ]E[R 2 lk }) + O ( M^)\ \ 

k ^ 

where n 3i j denotes the third cumulant of the (ij)-th entry o/W^y- 

The proof is in the appendix. Applying (12. 4[) to the right side of the equation in 
Lemma 12.11 and using uniform boundedness of the fife's we obtain 

7^5^ J2 K ^ k {^[R^RikRkk] + 2E[R 11 R kk R k2 ] + 2E[R\ k R 2k ]) 

k 



This concludes the estimates for the third cumulant terms of (|2.9[) . The fourth 
cumulant terms in (12.91) are: 



3!iV 2 



2_^ K i\k{^&RnRikRkkRk2 + 6i?n(i?fefe) R\ 2 + 18(i?ife) R k kR\ 2 + G{Rik) Rki) 
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Each of these terms except the second one are shown to be 0(Pi(\ Im(2)| _1 )iV _2 ) 
in [44] . The arguments in Lemma 12.11 and estimate (|2.2p gives the bound 

l^J2nRn(R kk ) 2 Ri2] = ^£E[i?ii(i4 fe ) 2 Mi? 12 ] + o (MMfT) 

k k ^ 

= () /P 10 (|Im(z)|- 1 ) 



TV 2 

Similarly, using that E[\W i3 -\ 4+a ] < N a / 4 E[\W l3 \ 4 } and a lengthy but straightfor- 
ward calculation the fifth cumulant terms are bounded by O ( j^/4 — - ) ■ The 
sixth cumulant terms can be bounded using the Cauchy-Schwarz inequality. For 
example: 

^ £ E[Rl k Rl lRl2 ) = ^ Y, nRlkRlMRii] + o (^T* ' 

k k ^ 



o 



N 2 



Finally, the seventh cumulant term is 0(Pi{\ Im(z)\~ 1 )N~ 9 / 4: ) and the truncation 
term is the sum of N terms each bounded by 0(Ps\ In^z)!" 1 )^ -3 ). 
Now we continue the study of the leading order terms of Eq. 



zE[i? 12 (z)] = a 2 E[R 12 (z)]E[tr N (R N (z))] + ^E\R 22 \E\R ll R 22 \ [ + O ^' uMhu{ ' ' ! 



jy3/2 L ZAi L 1L iz " V N 2 

Then by the Cauchy-Schwarz inequality and (|2.3p 



K ^E [Rll R 22 ] = ^E[R lin R 22] + O ( mi ^F 1] ) = ^E N (R wW )]^ + O < ^ M ^" 



JY3/2 I lJ - zz ' ^3/2 L 1X J L ^' y N b/2 J N 3/2 L «v ivwyj y ^ 5/2 

Therefore, 

(z - cr 2 E[tr N (R w (z))]) E[i? 12 (z)] = ^E^Ra^))] 3 + O (M^l 
This implies (see e.g. |44j ) that 

The proof is completed. D 

Now we use Proposition 12 . 2 1 to prove Theorem ll.4l 
Proof of Theorem \1.4\ Let u^y and vjy be unit vectors in C . Using (|2.1[) and 

(USD, 

AT 

E[u^R A r(z)v A r] = Y^Whi{z)]u>iVi + ^ E[i^- (z)]u^- 

= .g CT (z)5 ip + ^ u ^M 3 (z)v w + 0(P 12 (| ImCz)- 1 )^- 1 ) 
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Furthermore, if ||u/v||i = o(y/N) or ||vjv||i = o(viV) then 



N 3/: 



: u* N M 3 (z)v N 



< 






X] Mil ^ 



max^j |/U3,ij|||Ujv||i||Vjv||i 



i¥"j 



iV3/2 



illllXdk = (jy-V2) 



Remark 2.1. ///or all 1 < i,j < N, /X3 ij =^3 el f/ien 



D 



iV 3 / 2 



u^M 3 (z)v w 



W7ierel JV = ^(l ! ...,l) 



Vlv* 



M3 

TV 3 / 2 



M3 



JV N 



53«<S«i-2 



v i=l J'=l 



(/,(', 



iV 



^(u^l JV )(l^v N )-o(iV-V 2 ) 



3. Fluctuations of bilinear forms 



This section is devoted to the proofs of Theorems 11.51 and 11.61 As in Section 
2, for notation convenience we set Xn = Xn and Rat(z) = Rat(z). Recall that 
Proposition 11.11 implies that Theorem 11.51 gives the fluctuations of the outlying 
eigenvalues. Our argument follows the proof of Theorem 3.1 of |3]. We outline the 
proof, showing that the theorem holds in the more general case. 

In order to compute the limiting distribution of quadratic forms it is useful to 
represent sums as Riemann sums of certain integrals. In order to effectively make 
this representation it is useful to permute the elements of the unit vectors. Permut- 
ing the entries of unit vectors is equivalent to conjugating our Wigner matrix by 
a permutation matrix. The matrix obtained from Wjv by conjugation by permu- 
tation matrices is again a Wigner random matrix satisfying the same hypotheses 
as Wjv. In order to show a good permutation of the unit vectors exist we show 
that Lemma 3.1 of [3] can be extended to the case of multiple bilinear forms (see 
Corollary E3|). 

Theorem 3.1 ([?5] see Theorem 1 of [26|). Let {vi}fL 1 be a finite family of vectors 
in M. m of size N. The elements of Vi are denoted v\ for 1 < I < m. Assume that 
v{ < c for all i,l and J2i=i v i = 0- Then there exist a permutation it £ Sn and 
some universal constant K m depending only on m, such that 



[Nt] 






<cK„ 



1 „,2 



for all < t < 1, where ||w|joo = niaxi<j< TO \v \ for v = (v , v 

We also have the following corollary. 

Corollary 3.1. Let u 1 , . . . , u fc be a set of orthonormal vectors in C N . There exist 
a constant C , depending only on k, and a permutation n £ Sn such that 



ltN\ 

£ « 



J?** 



< Cmaxi } k\u k \ 
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Proof. For each N, consider the family {^i}i=i such that Vi € K with entries 
|*4 1 2 - i for 1 < I < k, Re(M-wf) and Im(M^f) for 1 < / < p < k. Note that for 
1 < q < k 2 and 1 < i < N, v? < maxi, k\u l k \. 



Then by Theorem 13.11 there exists a permutation such that 



\tN\ 

E « 

k=l 



N 



>ip 



< 



\tN\ 

E Re ( 



i —p 



N 



\tN\ 

E lm ( u lA 

k=l 



<2 



L*JVj 

E 
fc=i 



< 2K/,2maxi k\u 



D 



Now we show that the arguments of [5] can be extended to the case of multiple 
bilinear forms. The first step is to write entries of Gjv(z) as a martingale difference 
sequence. After estimating error terms the martingale central limit theorem is used. 
The bulk of the proof will be devoted to computing the covariance of the limiting 
Gaussian distribution. 

Proof of Theorem \1.5[ Recall that G p i(z) — \/N(u^IIn(z)u 1 n — E[u p ^TIn(z)u 1 n ]). 
By the symmetry of Gat(z) we prove the limiting joint distribution for G p i(z) for 
1 < p < I < m. By the Cramer- Wold theorem it is sufficient to study arbitrary 
linear combinations of the real and imaginary parts of the matrix elements of Gat. 
So we consider: 



^ *^2~Re(a p j ih )Re(G p i(z Pt i ih )) +Ivci(a Pi iM)Im(Gpi(zp il , h )) (3.1) 

l<l<p<mh=l 

for arbitrary complex numbers a p> i t h- We begin by decomposing G p i(z) into a 
martingale difference sequence: 



JV 



G p i(z) = VNY,(^k-i - E k )u%R N (z)u l N 

k=\ 

where E^ is conditional expectation given W%j for k < i,j < N and Ejv is E. 

In order to show that the limiting distribution of (|3.ip is Gaussian we use the 
following central limit theorem for martingale difference sequences |13j . 

Theorem 3.2. (Theorem 35.12 in [13],) F° r each N, let Y^k be a martingale dif- 
ference sequence with respect to T n , \,J- n ,2i ■ ■ ■■ Let 



a Nk — ^Kvfcl-^n.fc-l] 

Suppose that as N goes to infinity 



(3.2) 



T %k ->p ct 2 > 



and 



E-? 

fc=i 

^E[y^ fc /| yjVfcl > £ ]^o 



(3.3) 



(3.4) 



fe=i 
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for each e > 0. Then 

oo 

Y,Y N k^ D N(0,o- 2 ) (3.5) 

fe=i 

Before we continue with the proof we introduce some notation. We use e^ to 
denote the j th canonical unit vector and w^ to denote the k th column of the matrix 
Xjv = -7Tf~WN and w] denote the j th column of the matrix Xjv with the k th 

entry set to zero. Similarly, by R N (resp. R^ ) we denote resolvent of Xjv — 
e fc w fc — w k e' k (resp. Xat — e^w^ — Wfee' fc — e^w* — w^e^). Finally u l k is the k th 
entry of the vector \i l N . We use op(l) for terms that converge to zero in probability 
as N — >• oo. 

For the reader's convenience we contrast our notation with that of [3J. They 
use -A -1 (resp. -AJ 1 , -A^ 1 ) for Rn(z) (resp. IC N '(z), R^ k '(z)). Their unit 
vectors are denoted xjv and yjv instead of u l N . The vector Wj with the k th element 

(k) 

set to is denoted Wj k instead of our w) . Finally the Sticltjcs transform of the 
semi-circle law is denoted s(z) instead of g a (z). 

The first observation to be made is that (Efc_i — E. k )R^ N '(z) = so it can 
be subtracted from the martingale decomposition of G p i(z). Then the resolvent 
identity, (12.71) . is used to compare Rn(z) and R N '(z), leading to 

JV 

G pl (z) = VA^(E fe _! - E k )u%R N (z)(w k e' k + e k w* k )R^ u l N 
fc=i 

Then Rj\r(z) can be further expanded by twice applying the identity (see for 
example [3J Eq. 2.3) 

for any invertible matrix B and vectors u, v. With (u, v) = (e k ,w k ) and B = 
z — X^r and then (wfc, e k ) and B = z — (Xjv — e fc w j0- We remind the reader that 
X u =0forl<i<N. 
This leads to 



G pl (z) = VNj2^k-i-E k ) ^— (4<R^-4<+«*^+<R^W£R^M 

fc z - w^Rjv'wfc 

Checking the estimates in [3J from Eq. (4.1) - (4.4) leads to 

N 

G p i(z Pt i t h) = VNga(zpj ih )y^^E k -i[(Cl' l (zp,i,h) +ip k {z P ,i,h) +r%' (z p ,i,h)] +op(l) 



fe=i 
where: 



e k \z) := u l k vP*R%\z)w k and ltf{z) := Tf h w* k B%\z)u l N 



rf/(z) := wJE^H^RSJ'Ww. - ^« (R^(z))\ l N + ») 

It is easily checked that (|3 . 1|) satisfies condition (13.4[) . The following lemma 
considers approximations of the variance terms in (|3.3p and computes the limits 
of these approximations. The sketch of its proof will take the majority of the 
remainder of this section. 
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Lemma 3.1. Let u^, u q N , u r N , and u N be unit N -dimensional vectors such that 
any two vectors from this group are either orthogonal or the same. In addition, 
let us assume that the \\\\oo norm of at least one vector in each pair 11^,11^ and 
u q Nl u r N goes to zero as N — > 00. Then the following holds 

N^E k \E k -.i[$ (^ 1 )]E fe _ 1 [^' (z 2 )]]=p^ , n2N - kn ( -J ( , \ +°p^ 

/t = i fc=i 1 " n y<y( z i)ycr\ z 2) 

(3.7) 

(3.8) 

(3.9) 
* — ■> 7 *-— > o~ 2 Qrriz-i )q rr (z2)u[,u1 y^ ^1 u p u r - 

N^k^-iitfiziWk-ityr^)}] = E y Tn- VwV +op(1) 

(3.10) 

AT 

JVX)Efc[Efe-i[^ , '(«i)]E*-i[^' r («a)]] = (3-11) 

fc=i 

p ^ g 4 gg(^i)gg(^)E t >fc^"fE 3 >fc"j M j 1 A g 4 gg(zi)gg(z2)E i>fc *ff<E 3>fc iffi4 m 
^tl (l-^ 2 ^)^)) 2 JV^j (l-^^CO&rte)) 2 ° H) 

JV 

iV^E t [E t _ 1 [Cr 1 (z 1 )]EnK^2)]]+EnW J ( z i)]EnK^2)]] (3.12) 
fe=i 

+ E fe _ 1 [^(z 1 )]E,_ 1 [Cr^ 2 )] +E fc _ 1 [^'(z 1 )]E fc _ 1 [^'(z 2 )] = op(1), 

where op(l) stands for terms that converge to zero in probability as N — > 00. 
Additionally, as N — > 00 



2a 2 ffff (z 1 ) 5CT (z 2 Ku£XV > fc u K 1 ^cr 4 5 - 2 (z 1 ) ff 2 (z 2 )X; i>fe «Ei>fc' u j 






->Mgl(- 1 + 12 ... - ~ )• ( 3A3 ) 



fc=1 - - at y<7\~L>y<7W/ fe=] V- v 

1 

1 - v 2 ga(zi)g<r(z 2 ) 

Proof. We begin by proving (J3.7I) . 

7VE fe pEfc_i [«[<# (z^wJEnKu^RW (z 2 )w fe ]] 
= 7VE fe [(«iu^E fc _i [R( fe > (z 1 )]I fc w fc )(w^I fe E fe _ 1 [R« (z 2 ) T ]T? N u r k )] 
= o- 2 pu l k u r k u p *E k ^ [R( fe > («i)]I fc E fc _i [R (fc) (z 2 )]u^ 
Here Ifc := Ei>fc e j e j"' an d p = 1 if Xjy is real and is Xjv is complex. 
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We then apply the following approximation and its conjugate, from [3], to 



<E fc _! [RCO (zOJIfcEfc-! [R« (z 2 )]u 



e;.R^(z)ujv = ft, (*)(«, + wf R^(z)ujv) + o P (l) (3.14) 

After this approximation is applied, it is shown in [3] between Eq. (4.9) and Eq. 
(4.15) that the cross terms are op(l). Their analysis carries over directly leading 
to 



u£E fc _! [R< fc > ( Zl )]l k E k ^[R^ (z 2 )]u% (3.15) 

= . 9ff (z 1 )g CT (z 2 )^ fe + u5 r *E fe _ 1 [RW fc )(z 1 )w5 fc) ]E fc _ 1 [wf ) *R« fe )(z 2 )]u^) + o P (l) 



= fc(ari)ffa(»2)X) (^ + u^E fc _ 1 [R^ fe )(^ 1 )]I fc wf ) wf ) *I fc E fc _ 1 [RW( 22 )]u^) +o P (l) 
= ff CT (^)^(^ 2 )^ ("M + ^u^HpW^jjPulRWf^lu;] + op(l) 

j>k ^ ' 



(3.16) 



where on the last line, Lemma 8.2 of [3] is used to show that I^vr- 'w^ '*Ij. can be 
approximated by TV- %. Then from Eq. (4.16) to Eq. (4.23) it shown that R (jfc) (z) 
can be replaced with R( fc ' (z) by applying (|3.6|) twice and using the Cauchy-Schwarz 
inequality on the error terms. Leading to 



<E fc _ x [R« ( 2l )]I fc E fe _![R( fc ) (z 2 )]u^ (3.17) 

fc(^i)j^2) fe^ + ^u^^RWl^lIA^^feK +0p (l) 



i i>fc 



Solving for u^E fc „! [R^ fc ^ (z 1 )]I fe E fc _ 1 [R( fc ) (z 2 )}u q N , multiplying by u l k u r k and sum- 
ming over k gives 



^4^u^E fc _ 1 [R«(, 1 )]I,E fc _ 1 [R«(, 2 )]u^ = E44^#^f4 +op(1) 

(3.18) 
The proofs of (|3~g]) - (|3~TU)) follow similarly. 
The proof of ([3~TTj) follows 3^ from Eq. (4.24) to Eq. (4.32). 
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NE k [E k _ 1 [rf k ' l (z 1 )]E k _ l [r,l' r (z 2 )}] 
= AnEfcp^iIwJRW^iJu^u^R^J^Owfc - -^(u£R« a (*i)uk + u p k u l k /zf)} 

EnKRWf^u^u^Wf^w, - l(u^R fc ) 2 (z 2 )u^ +^^/z 2 2 )]] 

- ^a 4 ^ e;E M [RW(z 1 )u i N u p ;RW(z 1 )]e j e;En[R( fc '(z 2 )u r N u';RW(z 2 )]e, 

i,j>k 

+ £<7 4 ]T e^ fe _ 1 [R«(z 1 )u5 v u^R( fc )(z 1 )]e J e^E fc _ 1 [R( fe )(z 2 )u^u«;RW(z 2 )]e J 

i,j>k 



E E[l ^ |4] ^ (2 + P)g4 e;E t _ 1 [RW(z 1 )uj y u^RW(z 1 )]e i e;E fc _ 1 [RW(z a )u^ul5rRW(^)]e j 



i>fc 



When summed over fc the final term is shown to be Op(l) in [3j. Using the uniform 
bound on E^V^I 4 ] the proof in [3J also holds in this case. We apply (|3.14p and its 
conjugate as before to the first term, the second term follows similarly. As before 
the cross terms are shown to be error terms in [3J. 

Y e:E fc _ 1 [R( fe )(z 1 )u^ vU P;R«(z 1 )]e J e;-E fc _ 1 [R( fe )(z 2 )u^u^RW(^)]e J 

i 7 j>k 

= g a { Zl )g a {z 2 ) J2 (e^ k _ 1 [R^(z 1 )n l N (u p j +xi^R^(z 1 )wf ) )} 

i,j>k 

E*_i[(«S + wf ''RWfeluMu^ROfelej) + o P (l) 
= g a {zi)g a {z2) J2 (^u p j e i E k _ 1 [R^(zi)u l N ]E k _ 1 [u' 1 *R^(z 2 )]e i 

i,j>k 

+ e^ fe _ 1 [R«(z 1 )u^u^R^ fe )(z 1 )]I fe wf ) wf ) *I fc E fc _ 1 [R^(z 2 )u^ v u^R( fe )(2 2 )]e i )+ 0p (l) 



Then Lemma 8.2 of [3] can be used to replace IfcW^ wj I k with N~ I k and 
that R( jfc )(z) can be replaced by R( fe )(z) by applying (|3.6[) twice and using the 
Cauchy-Schwarz inequality on the error terms. This leads to 



= ffCT (z 1 ) 5CT (z 2 ) J2 (u r j u p j e' i E k _ 1 [R^(z l )u l N }E k _ 1 [u%R^(z2)}e i 

i,j>k 
2 \ 

= gMg a (z2) [E^E fc _ 1 [u^R«( Z2 )]I fc E fc _ 1 [R( fe )(z 1 )u5 v ] 



7V-fc 



a 2 J2 e:E fc _ 1 [RW(z 1 )u>^RW(z 1 )]e J e;.En[R«(z 2 )u>«;RW(z 2 )]e,; +o P (l) 



TV 

i,j>k 
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In the last line, the first term is ga{zi)g a {z2)^ j>k u 7 Af, times (I3.15|) and the 
second term is g a (zi)g a (z2)(j' 2 (N — k)/N times the original expression. Solving for 
the original expression completes the proof of (|3.1ip . 

The remaining terms, (I3.12[) . contain one rj and one ip or £. Each term goes to 
zero in probability, for example: 



E E nWjk ^ jk] E fe [uiuyE^x [Rff (zOle^E^ pEtffi feKuffRff (z 2 )]e 






C* 
< 



^=max|u^E fc _ 1 [RW(z 1 )]e J |^4l|IE fe -i[R^ ) (^)u5 v ||||u^R( v fc) (z 2 )]|| 

Which goes to zero because max^ |u^Efc_ 1 [R) v , (z 1 )]e :; -| = op(l) as shown in [5] 
Eq. (4.31). The other terms are similar. 

This completes the proof of the first part of the lemma. We now turn to com- 
puting the limit of the sums in (|3. 131) . 

We begin with the case where the HHoo norm of all vectors goes to zero, and 
assume that the elements of each vector {u'}i<;<fc have been permuted by a per- 
mutation given in Corollary 13. II Permuting the entries of each vector {u }i<;<fc is 
equivalent to conjugating Wjv by a permutation matrix, which returns a Wigner 
random matrix. 

If u q N ^ u l N then by orthogonality and Corollarv l3.1[ | J2j>k ^j ul j I — ^ m max i,a \ u i I 
Using the Cauchy-Schwarz inequality shows that the first term in (|3.13|) is bounded 
by 

E"K , Nk J f 7 4 , r <^ ro maxKl^|uKI<^maxK|||u^||||u^|| 
k l ~ ~7T a 9A z i)9A z 2) ^ a k 

A similar argument works to bound the second term of p,13[) . 

If u 9 N = u^y but x\? N ^ u r N then we use the following summation by parts formula 
to bound the sum. 

N N N-l N 

^2 9^ ik = h £ 9k + ^2 (f k+i ~ ^ E 9j 

k=l fc=l fc=l 3=k+l 

The formula is applied with g k = u p k u r k and fk = - — N - k 3 ^. k 3 — ; — r> Using the 
estimate 





E^+iKT 


1- 


- E^l^g (T (z l )g a (z 2 ) 



£j 



>fc i u j 



uV 2 



l-E^i a 2 gAzi)gAz2) 
j>k 



1 - Zj±a* gi ,{zi)9M 



«|2 \M«,9|2_2 



EKi 



Wfcl - > W\ a 



ga{zi)g G (z 2 ) 



N-(N-k)a 2 g a (z 1 )g a (z 2 ) 
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with the summation by parts formula gives: 



J2 u k u kY 



E 



3>k \ U<1 3 \ 2 



M ^(^)fcfe) 



TV 



< 



Ej>l \ U J 



<l\ 2 



1 - ^N L ^ 2 9cr(Zl)ga(z2) 



Km max I uf I 



AT-1 

E c h 
fe=i 



9|2 



C 

iV 



JV-1 



EEKi 
fc=i j>fe 



i^rr, max I uf I 



The right side of the last inequality goes to zero in the limit. The above arguments 
can be adapted to the case when the HHoo norm of at least one of the vectors in 
each pair u l N , u^ and u^, u r N goes to zero as N — >• oo. 

Returning to the case where the HHoo norm of all the vectors converges to zero. 
If p = r and q — I then as N — > oo the Riemann sums converge to the following 
integrals. 



N „2 

E 

fe=i 



^gtxOikx^KI 2 J2j>k \ u j 

l-cr 2 ^g <J (z 1 )g <J (z 2 ) 



<n 2 



1 (T 2 {l-t)g <7 (z 1 )g <7 (z 2 ) 



-clt 



1 y- ^9l{zi)gl{z 2 )Y Jl> i \"; 



l>\2 



E 



,-> fe KT 



:i - cr 2 (l - t)g a (z 1 )g (T (z 2 ) 

1 (a 2 g a (zi)ga(z 2 )(l-t)) 2 



N 



fe=i 



(l^^fe)fcfe)) 2 



(l-(l-*)*V(«i)ftr(*2)) 2 



Computing the integrals proves the lemma. 

Now we consider the case \u^,\ 2 = EI=i ^ka i(N) bi for some finite r with Y] b% = 1 
then this sum can also be computed. 



A' 



,P|2 



^VOi)^^) E ', 

Vfe=i x - ° 

N „\„2( „\„1i 



E 



,912 



N 



3>k l"jl 



^ KI 2 E J>fc KI 



^ft^i)^) 



1 " a 4 . 9 g(z 1 ) g g(z 2 )E,> fc KI 2 E 3 > fc KI 



N 



fc=l 



(1 - ^a 2 ^)^)) 2 



E 6 * ( ^ 9o{z x )g a {z 2 ) I — - 



1 - ffli/JV 



-a 2 ffff (z 1 ) 5CT (z 2 )(l-a i /iV) 



"iV^Wl- (1 



^gKz^gUz.Kl-k/N) 



f^[ (1 - (1 - k/n)a 2 g a (zi)g a (z 2 )) 2 



E i_ cr ; 



1/7V 



fc=i 



<7 CT (*i)<7 CT (z 2 )(l - */#) 



The sequence a^N)/N does not necessarily converge, but it is bounded. Let A be 
a sub-sequential limit. Along this subsequence, the above term converges to 



^b l a 2 g G {z l )g l7 { 



I- A 



*2 



1 - o- 2 5 <T (zi)5 -(z 2 )(l - A) J 1 - a 2 g a (zi)g<j(z 2 )(l - t) 



dt 



< 7 4 gg(z 1 ) g g(z 2 )(l-t) 

/ (l-(l-t)a 2 g4 Zl )g a (z 2 )) 2 - 
Computing the integrals shows that the term is independent of A. 
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Furthermore we can consider arbitrary u p . We begin by permuting the entries 
so they are non-increasing. Then there exist some M such that for all m > M, 
u^ — > 0. By linearity the above analysis can be applied to the part which goes 
to zero and the part which does not separately and then combined for the desired 
result. 

□ 

Now we conclude the proof of Theorem 1 1.5 1 bv noting that Lemma l3~Tl along with 
the martingale central limit (Theorem I3.2J) implies that centered (|3.1[) converges in 
distribution to a Gaussian random variable with variance: 

^ 2^ 7^—p(Jl{ziph,ziph>)+'R(ziph,ziph>)) + (n{zi p h,ziph>) +H{zi P h,ziph')) 

Iphh' 

+ Sip{p(U(ziph,zi p h') +~tt(~ziph,zi p h>)) + (n{zi p h, ziph') +K(ziph,zi p h'))) 
Rc(a lph ) lm{ai p h') 



p(U(zi p h, ziph') -Ti{ziph,zi p h') + (-~tt{zi p h, zi ph >) +TL(ziph,zi p h')) 
T i P h') + H(ziph,ziph')) + (Hiziph, ziph') - n-(ziph,ziph'))) 

p{U(z lph , ziph') + ^{zip h ^ip h >)) + (-TI(ziph,ziph>) - Hi'ziph, ziph')) 



2 2 

Si p {p(-IL(ziph,z~ip h ') -Ii(zi p h,ziph')) + (n(zi p h, zip hl ) +'R(ziph,ziph>))) 



Recall that II(zi, z-x) was defined in (|1.29[) . The proof of Theorem ll.5l is complete. 

D 



Now, we turn our attention to Theorem 11.61 
Proof of Theorem \1.6[ Denote 



/ />+oo 

|/||n,i := max I / 

\J — OCi 



dx l 



dx, 0<l<n) . (3.19) 



If ||/||5,i < oo, the Gaussian fluctuations for the entries in (|1.32[) follows from 
Theorem 1 1.51 and the bound 

V («, f(X N )u p N )) < ConstMhl (3.20) 

(equation (1.33) in Theorem 1.6 in 44 ) by a standard approximation argument 
(see e.g. the last three paragraphs in Section 4 of [10] )■ It should be noted that 
(|3.20p follows from the bound (2.4) in Proposition 2.1 in [33], i.e. 

V((ui V) R N (zK})=0(*^) (3.21) 

by applying Hclffcr-Sjostrand functional calculus ([25], [20] )■ To prove Gaussian 
fluctuation for an arbitrary function / £ T-L s with s > 4, one has to strengthen 
(EOTl) and use 

V^.lMzK)) = O (<M»^) +0 ^(EHR.^f/^dlm.r^ 

(3.22) 
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by repeating the steps of Proposition 3.2 in [40]. One then applies Proposition 2.2 
in [40] (see also Proposition 2 in [47] or Proposition 1 in [45]) to prove that 

Y((u l N J(X N )u p N )) < Const s l!£Jk (3.23) 

The Gaussian fluctuation then follows by a standard approximation argument as 
before. The estimate of the mathematical expectation of (u l N , /(Xjy)u^) follows 
by applying Theorem 11.41 and the Helffer-Sjostrand functional calculus. Theorem 
11.61 is proven. □ 



4. Proof of the Theorem 11.31 

Proof of Theorem ] l.S\ We begin with \0j\ > a an eigenvalue of A^v with multi- 
plicity kj, the orthonormal eigenvectors of Ajy corresponding to 9j are labeled 
u]y, . . . , u^ . Following Theorem II. 5 [ G 3 N (z) is the kj x kj matrix with entries 

(G j N (z)) pl = VN(u p *R N (z)u l N - E[u^R w (z)u5 v ]) 

By Proposition ll.il the fluctuations of the eigenvalues can be expressed in terms 
of the fluctuations of the eigenvalues of a J N . Then using the definition of S, J N , the 
estimate on >/N(((u l N ,Rn(z)u p n ) — (u 1 n ,'Rn(z)u p w )) and Theorem 11.41 leads to: 

ZJ m = VN((u l N> R N (z)u p N )-U lp ) 



= VN(((u l N ,R N {z)u p N ) - (u l N , Rn(z)u p n )) + («, Rn(z)u p n ) - E[«, R N (z)u" N )]) 
+ (E[(u l N ,R N (z)u p N )]-^S lp )) 

N((u l N , Rn(z)u p n ) - E[(i4, R N (z)u p N )}) + VN(E[(u l N ,R N (z)u p N )} - 1<J, P ) 



+ s/N((u l N , R N (z)u p N ) - «, R N (z)u p N )) 
= G J N {p 3 ) lm + J^<M 3 u™ + 0P (1), 



where \/N((u 1 n ,Rn(z)u p S! ) — (u 1 n ,Rn(z)u p n )) = Op(l) follows from Lemma 2.1 of 
|40) and Lemma |5. II in the Appendix. By Theorem 1 1.5 1 G J N (z) converges weakly in 
finite dimensional distributions to the kj x kj matrix valued random field T(z) with 
independent centered entries that are Gaussian. Since the difference between G 3 N {z) 
and a J converges to a constant in probability, 3 J converges in finite dimensional 
distributions to T(z) plus that constant. All that is left to check is that entries 
have the announced variance. 
First note that: 

n(Pi,Pi) = (g(Pj) 2 - g'{Pi)) = f-4 ■>- — 



e 2 -a 

2 +a 2 -6 2 ^ f - 2 



2 (9 2 -a 2 )J \6 2 {6 2 -a 2 ) 
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2:; 



So we conclude: 



lim E[Re(r lp ( Zl ))Re(T lp (z 2 ))} 

iv— >-oc 



(V+iKp + i) 



9 2 (6 2 -a 2 ] 



N 



lim E[Re(r /p (z 1 ))Im(r ;p (z 2 ))]=0 



lim E[Im(r /p (z 1 ))Im(r ;p (z 2 ))] = 

N— »CO 



Multiplying by (c e / ff ;(p)) 2 = 



(l-^)(l-p) 



<9 2 ( 



— a 2 ) 2 gives the desired variance. 



□ 



5. Appendix 



Proof of Lemma \2.1\ We will prove the first estimate, at the end of the proof 
we note that the other two estimates can be proved similarly. We apply the 
resolvent identity (|2.7I) and decoupling formula (|2.8I) to J2 k E[Ri 2 RikRkk] and 
^ fe E[i?i2]E[i?ifei?fefe]. After estimating error terms we study the difference of 
J2 k E[R 12 R lk R kk ] and ^ k E[Ri 2 ]E[R lk R kk \. 



,} E[Ri2R\kRkk] — E[^ ^XuRgRlkRkk] 



(5.1) 



l.k 



In the decoupling formula, 



, the second cumulant term is 



2 
"77 2_^ ^[((Rjv)l2 + Tr(Rjv)i?12)-Rlfc-Rfcfc + (Rjv)2fc-Rll-Rfcfc + (R-Ar)l2-Rl/c-Rfcfc 



+" (R-Af)2fc-Rlfe-Rfel + (R-Jv)2fc-^l&-#lfe] 

A 



^ J2 E[Tr(R N )R l2 R lk R kk ] + O 0-^- 



ATi/2 



We used (|2.6[) to estimate the error. 

Each of the third cumulant terms will have 3 Vs and 3 fc's in the matrix subscripts. 
For example: 



Y/2 /_^ K 3,uRiiE[RiiRi2RikRkk] — — ^jE[^ ^nz^uRiiRiiRn) Ri k R kk ]. 

All the 



_/V3/2 ^l^ ,, 'J,i^ , -" J "L->-ii--^--iK--KKJ - N3 / 

k.l I k 

Using (|2.6j) the absolute value of this term is bounded by O ( —^ij- 
third cumulant terms can be bounded in the same manner except: 



Yn 2_^ K 3,llRllRl2RllRlkRkk 



A 3 / 



k.l 



< 



K p p r(^)r r(-D) 
AT3/2 12 X1 -W ±1 W-tljv 



^ -^|imW|- a l|Rlf ) lll|R^IIIIR^ ) ll-o/ r l Im W 



■^3/2 " V~>M M—yV MM"iVMM— JV H ^^ jy-l/2 

where R^ is the N-dimensional vector with R t = Rj. 
The fourth cumulant terms are of the form 



— 2 ^_^ K4 } uRk a R**R**R**R**R* 



l.k 



A|Im(z)|- 5 ^ |o n /|Im(z 



ATi/2 
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where a G {1,2,Z}. A similar argument works for the fifth cumulant terms, using 
that K5 t u < constN 1 ' 4 . 

Finally the truncation term is the sum of N 2 terms, each bounded by | Im(z)\~ 8 N~ 5 / 2 . 
Applying the above estimates to (|5.1|) leads to: 



z^E[R 12 R lk R kk }^a 2 Y,n^N(R N )R 12 Ri k R kk }+o(^^0-^ 

k k ^ 

(5.2) 
Now we apply the resolvent identity (|2.7I) to the other term of interest. 
^E[i? 12 ]E[i? lfe i? fefe ] =J2 E [Ri2]^[RkkS k i+J2 XnRlkRkk ^ 

k k I 

Before applying the decoupling formula note that by (|2.2p 

\mi2]mii]\ < mh f rl) 

The second cumulant term is: 

2 
"77 2_^ ^[Rl2]^[(RllRlk + RllR\k)Rkk + Rlk{Rk\Rlk + RklRlk)] 
l,k 

= 77 E E [ i? ^]E[Tr(R Af )i? lfc i? fefe ] + O C^j" 4 
The higher order terms can be bounded as before leading to: 



z 



EE[i? 12 ]E[i? lfe i? fefe ] = a 2 ^E[i? 12 ]E[tr JV (Riv)i?ife J Rfefe] + O . y) , 



(5.3) 



Taking the difference of (|5~2|) and ([53]) and subtracting £) fc cr 2 E[trAr(i?)]E[i?i fe i? fcfc (i?i 
E[i?i2J)] from both sides of the equation leads to: 

(z - a 2 E[tr N (R N )}) ^ E[R lk R kk (R 12 - E[R 12 ])] 

k 

= a 2 J~j E[tr N (R N )R 12 R lk R kk ] - a 2 ^ E[R 12 ]E[tr N (IL N )R lk R kk } 

k k 

a 2 E[tT N (K N )]J2nRikRkk(Ri2 - E[R 12 ])] + O ( '^ T "" : " 



TVi/2 

k x 

= a 2 ^E[(tr J v(R JV ) - E[tr A ,(R JV )])i? lfe i? fefc (i? 12 - E[i? 12 ])] + O ( ^ ( ' ^ff ' ' } 

Therefore, 

\(z - a 2 E[tr N {R N )})Y / HRi2(RikRkk -nRikRkk})}\ 

k 

'PsdM^I" 1 ) 



< a 2 2 | Im(z)|- 2 V(i? 12 ) 1 / 2 V(tr Af (R JV )) 1 / 2 + O 



ATi/2 
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which implies 



2_^E[i?i2(i?ifei?fcfc — E[Ri k R kk ] 



< 



ATl/2 



2 



where we use the bound \(z — er 2 E[trAr(R7v)]) 1 | < | Im(z)| *. 

The second estimate is proved in the same manner, beginning with the resolvent 
identity (22) zR k2 = (S k2 + £, RkiX l2 ). 

Similarly, to study J^k ^[R2kRi k ] we apply the resolvent identity (|2.7j) z J2k RzkR\ k 
R 2 i2+J2i,k X^RikRlk and to study £ fc E[R 2k ]E[R 2 lk ] we apply z £ fe E[R 2k ]E[R 2 lk ] = 
E[i? 2 fc]EE, fe X u R lk R lk ] + E[±R 2 i}E[R n ] and proceed as in the proof of (J2~TU|) 

' " D 

5.1. Removal of Diagonal Terms. In this section we truncate the entries of Wat 
and remove the diagonal elements that arc not relevant in the limiting distribution 
of u* n TIn{z)vn- In this procedure we will consider arbitrary unit vectors uat and 
vjv in order to change the assumption of five finite moments to the optimal four 
finite moments in )44| . Because conjugating a Wigner matrix by a permutation 
matrix gives a Wigner matrix we can without loss of generality assume that there 
exist some finite m such that the entries of ujv and v^v that do not go to zero are in 
the first m < oo coordinates for all N. In what follows we will set all the diagonal 
elements Wu for i > m equal to zero. 

We note that the results [44] show that the diagonal entries of Wjy cannot be 
removed without effecting the limiting distribution if the || ||oo of ujy and vjy do not 
go to zero. 

We begin by noting that Lemma 2.1 of [10] allows us to replace Wjv with another 
Wigner matrix whose first 2 moments match but its entries are bounded by €n\N 
for some cn that goes to 0. For the remainder of the section we assume that we 
are working with the new matrix. 

Lemma 5.1. Let Xjy = ~7W^ N be a random real symmetric (Hermitian) Wigner 
matrix defined in HI. HI. 6}) (respectively \1.8\il. llty ). Let Un,~vn be a sequence of 
orthogonal N dimensional unit vectors such that (u/v)j, (vjv)i — ► for all i > m 
and some m < oo. Let diag m (X.^) be the diagonal matrix such that diag m (X.N)a — 
(Xjv)m for i > m and otherwise and Rjv(z) := (zIn — (Xjv — rfJa<7 m (Xjv))) x 

N(u%R N (z)v N - u* N R N (z)v N ) -^ P 0. 

Proof. Let e > 0. We begin by noting that by Proposition 2.1 of [40] the event 
fi := {Xjv|||Xat|| < 2cr + e} has measure going to one as N — > oo. So it is sufficient 
to prove convergence on this event. 
Then using the resolvent identity 



N(u* N R N (z)v N - u* N R N (z)v N ) = V N(u* N R N (z)dia,g m (X N )R N (z)v N ) 



E 

2>m 



(i4R(*)) Wu (r{z)v n 



Let Q' := {XAr|||XAr-diag m (XAr)|| < 2a + e + e N } and note that Q, C fl' . The set 
£1' is useful because the resolvent R^r(x) (defined below) for real x > 2a + e + e^ 
is bounded by (x — (2a + e + cat)) -1 . 
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We now show the above term goes to zero in L 2 





2 








), 


1{0} 


<E 


VjV 


), 


2 


2'} 





£ (u^R(*0) W"«(r(*)vjv 



2 (u^R(z))^(R(z)v^ 

£>77i 

= 5]E |(u^R(*))^(r(z)a 

i>77l L 

53 E U(u* N R(z))Wu(R(z)v N )^ ((v* n R(z)) W n (R(z)n N ) ) l {n , } 



1{0'} 



(5.4) 



i^j>m 



Let R$°(z) = (zljv - (Xjv - eiJfoeJ))- 1 . Applying {23} gives 



(5.5) 



R N (z)e l = (zI N - (Xn - eiXue'i) - e l X il e' i ) e* 
^(^(l-A^R^ 



= R^ ) (z)e,(l-^(e^ ) (z)e J ))- 1 



Let A := (1 - X M {e' i R^\z)e i ))- 1 . 
We bound (|5T4"|) by: 

53E[|(u^R(^)) i W ii (R(^)v JV )i| 2 l { ^ } ] 

= 53 E[|(u^R(" ) (z)) 4 W 4l (R(z)v A r) l /3 4 | 2 l { ^ } ] 

= 53 E^]E[|KR('«)( z )),(R( z K) l | 2 l {ff) ] 

2>m 

- E[|(u^R<*H*))<W<(R(*)vjv)i| a (l ~ lft| 2 )l { n'} 
The first part of this equation (|5 . 6[) is bounded by 

53E[W i 2 ]E[|(u^R(")(z)) i (R(z)v Ar ) J | 2 l {n , } ] 



(5.6) 
(5.7) 



< maxE[^ 2 ] 53 53 E[\(u* N R^(z)) t R lk v k \ 2 l {n , } } + J^ E[|(u^R^(z)) l i? lfc « fc | 2 l {0 , } ] 



z>m \k>ni 



k<m 



< C maxM 2 53 E[\(u%R^\z)I m+1 K(z)) k \ 2 l {n , } }+CmmaxE[max\R lk v k \ 2 l {in } 

\ k>m * — ' ' '-' '" 



k<m i>m 



The first term of this term converges to zero because v k — > for all fc > to. The 
second term converges to zero because the 'R[maXi^ k Ri k l{Q / i}\ converges to zero, 
indeed starting from the following identity for an off-diagonal resolvent entry (see 
for example [25] ) 

Rik = Rii&kliXik + w*R( 4fc >w fe ) 
Let e > 0. Using E[w*R^ fe 'wfc] = and Markov's inequality gives that 

P (K ft,« wtl > 9 < HK^tQ < jsl 
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then using the crude bound on Ru 

F{max i>m \Rik\ > e) < P(max l>m C\X lk + w*R (lfc) w fc | > e) 

< C{¥{max i>m C\X ik \ > e) + AP(C*|w*R( lfe ) Wfe | > e)) 
-4-0. 

Along with the fact that |i?ifc(z)|l{n'} is bounded shows that ¥\maXi >m Rik^-{ii'}\ 
goes to zero and hence (|5.4j) goes to zero as N — > oo. 
The second term (|5.7[) converges to zero because: 

v. p(«) / -, 

(i - A) i {no = 7;,; i w < ^ (5.8) 

1-X^.RV. ; (z) 

and the sum over the rest of the terms is bounded. 

To bound the cross terms in (|5.5p . we begin by defining R^' n '{z) := (zIn — 

(X N - e^XuBi - e'jXjje,))- 1 and fo := (1 - X^Rp^ (z)e j ))~ 1 . Applying 
(|3.6[) twice leads to 

R N (z)ei = R ( ^\z)e l /3 i 



ft (llj^Me, X jj R^\z)^e j Rp jj) 



(z)e l (3 J , l j 
Applying this expansion to (|5.5|) : 
J2 E[((u^R(z)) l ^(R(z)v JV ) l )((v^R(z)) J W r JJ (R(z)u Ar ),)l {0 , } ] 

= ]T E[ft (u^R^' w) (z)e, - X jj u* N Rp' jj) {z)e j e' j Rp jj) {z)e i /3 j ,^ W a (R(z)\ N )i 



(jy%R(z)) j W jj (B.M(z)u N ) j ) l {n , } ] 

ft ( (u* N T4 i '" ) (zj) i W u (R(z)vn) Xv^R(z)),^(Rto)(z)u Ar ), 

(u^R^^ } (z)) i?(f^ ) (z)ft, J ^(R(z)v JV ) l (v$,R w (Z)) . W^R^^u^ J 1 {IV} 









The first term is estimated by: 



J2 E [ft (v^Rp^ (z)) . W u (R(z)v N ) i (v* N R% j) (z)) Wa (r(z)u n ) l {a > } 



i^j>m 



= Y, nW 33 ]E (u%Rp jj \z))w u (R(z)v N ) i (v%R% j \z)) (r(z)u n 



i^j>7n 



1 {n>} 



. to) , 



J2 E (1-ft) (u^R^C^j.Wii^Wviv). (v^R^'(2)J Wj^R^ujvJ 1^,, 



The first term is zero because E[Wjj] = and the second converges to zero using 
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The second term is estimated by 

Y E fox,, (xi%Kf"\z))Rf"\z)ll i , J Wu&{z)v N ) i (v^%))^(R(z)u^-l m 

-L Y PiW a (R(z)v N )i Y W n (u^R# J ' j) (z)) . Rf"\z)^,i (v^%)) W n (K(z)n N ),l {n , } 



E 



j^i>m 



Then we proceed as before, using (|5.8[l to replace /% with 1 and then using inde- 
pendence of Wa and that its expectation is 0. 
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